Unsupervised Discovery of Facial Events: Learning a Dynamic Vocabulary for Facial Analysis

نویسندگان

  • Fernando De la Torre
  • Feng Zhou
  • Jeffrey Cohn
چکیده

Automatic facial image analysis is a long standing research problem in computer vision. A key component in facial image analysis, largely conditioning the success of subsequent algorithms (e.g. facial expression recognition), is to define a vocabulary of possible dynamic facial events. To date, that vocabulary has come from the anatomically based Facial Action Coding System (FACS) or taxonomies derivative of FACS (e.g., EM-FACS and basic emotions). Each FACS action unit (AU) corresponds to the movements of one or more facial muscles. The aim of this paper is to discover a taxonomy of facial movements directly from video, without recourse to FACS or its derivative. Using unsupervised learning, we discover dynamic facial events directly from video of naturally occurring facial behavior (i.e., not posed) of multiple people. Several issues contribute to the challenge of this task. These include non-frontal pose, moderate to large out-of-plane head motion, large variability in the temporal scale of facial gestures, person variability and the exponential nature of possible facial action combinations. To address these problems, this paper proposes a novel temporal segmentation and multi-subject correspondence algorithm for matching expressions. Our method achieved good convergent validity with manual FACS annotation. To the best of our knowledge, this is the first attempt to learn a dynamic vocabulary of facial events from video.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Unsupervised Discovery of Facial Events: Learning a Dynamic Vocabulary for Facial Analysis

Automatic facial image analysis is a long standing research problem in computer vision. A key component in facial image analysis, largely conditioning the success of subsequent algorithms (e.g. facial expression recognition), is to define a vocabulary of possible dynamic facial events. To date, that vocabulary has come from the anatomically based Facial Action Coding System (FACS) or taxonomies...

متن کامل

RI:Medium: Unsupervised and Weakly-Supervised Discovery of Facial Events

The face is one of the most powerful channels of nonverbal communication. Facial expression has been a focus of emotion research for over a hundred years [10]. It is central to several leading theories of emotion [17, 30, 47] and has been the focus of at times heated debate about issues in emotion science [18, 24, 43]. Facial expression figures prominently in research on almost every aspect of ...

متن کامل

A Nonlinear Grayscale Morphological and Unsupervised method for Human Facial Synthesis Based on an Example Image

Human facial generation of example image is used as a requirement for biometric applications for the purpose of identifying individuals. In this paper, face generation consists of three main steps. In the first step, detection of significant lines and edges of the example image are carried out using nonlinear grayscale morphology. Then, hair areas are identified from the face of sample. The fin...

متن کامل

RI:Medium: Unsupervised and Weakly-Supervised Discovery of Facial Events

The face is one of the most powerful channels of nonverbal communication. Facial expression has been a focus of emotion research for over a hundred years [11]. It is central to several leading theories of emotion [16, 28, 44] and has been the focus of at times heated debate about issues in emotion science [17, 23, 40]. Facial expression figures prominently in research on almost every aspect of ...

متن کامل

Hierarchical Unsupervised Learning of Facial Expression Categories

We consider the problem of unsupervised classification of temporal sequences of facial expressions in video. This problem arises in the design of an adaptive visual agent, which must be capable of identifying appropriate classes of visual events without supervision to effectively complete its tasks. We present a multilevel dynamic Bayesian network that learns the high-level dynamics of facial e...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009